ViDeDup: An Application-Aware Framework for Video De-duplication
نویسندگان
چکیده
Key to the compression-capability of a data deduplication system is the definition of redundancy. Traditionally, two data items are considered redundant if their underlying bit-streams are identical. However, this notion of redundancy is too strict for many applications. For example, for a video storage platform, two videos encoded in different formats would be unique at the system level but redundant at the content level. Intuitively, introducing application-level intelligence in redundancy detection can yield improved data compression. We propose ViDeDup (Video De-Duplication), a novel framework for video de-duplication based on an application-level view of redundancy. The framework goes beyond duplicate data detection to similarity-detection, thereby providing application-level knobs for defining acceptable level of noise during replica detection. Our results show that by trading CPU for storage, a 45% reduction in storage space could be achieved, in comparison to 8% yielded by system level de-duplication for a dataset collected from video sharing sites on the Web. We also present tradeoff analysis for various tunable parameters of the system to optimally tune the system for performance, compression and quality.
منابع مشابه
Green Energy-aware task scheduling using the DVFS technique in Cloud Computing
Nowdays, energy consumption as a critical issue in distributed computing systems with high performance has become so green computing tries to energy consumption, carbon footprint and CO2 emissions in high performance computing systems (HPCs) such as clusters, Grid and Cloud that a large number of parallel. Reducing energy consumption for high end computing can bring various benefits such as red...
متن کاملDouble Cervix with Normal Uterus and Vagina - An Unclassified Müllerian Anomaly
Müllerian anomalies are very common, and a frequent cause of infertility. The most used classification system until now, proposed by the American Society for Reproductive Medicine in 1988, categorizes comprehensively uterine anomalies but fails to classify defects of the cervix or vagina. This is based on a developmental theory that postulates that müllerian duct fusion is unidirectional, begin...
متن کاملDe novo duplication 3q in an infant with a vascular ring and features overlapping Cornelia de Lange phenotype
Partial duplication of chromosome 3q is a recognizable syndrome with characteristic facial features, microcephaly, digital anomalies, genitourinary and cardiac defects as well as growth retardation and developmental delays. While there is clinical overlap with the unrelated Cornelia de Lange syndrome (CDLS), there are distinguishing features and molecular etiologies. Most cases of 3q duplicatio...
متن کاملAnalyzing Compute vs. Storage Tradeoff for Video-aware Storage Efficiency
Video content is quite unique from its storage footprint perspective. In a video distribution environment, a master video file needs to be transcoded into different resolutions, bitrates, codecs and containers to enable distribution to a wide variety of devices and media players over different kinds of networks. Our experiments show that when 8 master videos are transcoded into most popular 376...
متن کاملSEEC: A Framework for Self-aware Computing
As the complexity of computing systems increases, application programmers must be experts in their application domain and have the systems knowledge required to address the problems that arise from parallelism, power, energy, and reliability concerns. One approach to relieving this burden is to make use of self-aware computing systems, which automatically adjust their behavior to help applicati...
متن کامل